CCACGCGTCCGCGCGGGAGCTGCTTGGAGGCTCGGCGGCCGGGAGGAGGCCGGGGCCACGCTTCTTGGAA 
GCTACTGAGTGACTTCTTTGAAGAACCATGAAGTCACACTATATTGTGCTAGCTCTAGCCTCCCTGACGT 
T C C TGC TGTGT CTC C C C GTGTC C C AG AGCT GT AAC AAAGC ACT CTGTGC C AGCGAT GTG AGC AAATGCCT 
CATTCAGGAGCTCTGCCAGTGCCGGCCTGGAGAAGGGAACTGCCCCTGCTGTAAGGAGTGCATGCTGTGC 
CTCGGGGCCCTGTGGGACGAGTGCTGCGACTGTGTCGGTATGTGCAACCCTCGGAATTACAGCGACACCC 
CGCCCACATCCAAGAGCACCGTGGAGGAGCTGCACGAGCCCATTCCGTCCCTGTTCAGGGCGCTGACGGA 
GGGCGACACCCAGCTGAACTGGAACATCGTCTCCTTCCCTGTGGCAGAGGAGCTGTCACACCATGAAAAC 
C TAGTC TCCTTCCT AG AAACTGTGAACC AGCTGC ACC AC C AAAAC GTGT C TGTTC CC AGCAAC AATGTCC 
AC GCCC C C TTCCC C AGC G AC AAAGAGCGC AT GTGC AC AGTGGTTT ACTTTG ATG ACTGC ATGTC C AT CC A 
CCAGTGTAAGATATCCTGCGAATCCATGGGTGCATCCAAGTATCGCTGGTTTCACAACGCCTGCTGCGAG 
TGCATCGGTCCAGAGTGCATTGACTATGGGAGTAAAACTGTCAAGTGTATGAACTGCATGTTTTAAAGAG 
GGGGAAGAAATGCAAACCAAAGCAAGAACGCTGACGATGCCGTTGACACTCCTACCTTGCAGGATGTCCC 
CACAGAGTCCACTGGACCAGCTTGTACAAGGGTGAAACCTTTCTGCGTCACCTTTCTGCTTTTGCAGAGC 
CATGACAGCAGCTGCCTTACAGCGTGATTTAGCAGACAGACCTGACAGGTGTGTCATGTGACAGGCAGCC 
ATCAGGGTGGCCTAGCACATTTGTGGGTGCTGATGTTGTATTGCACTGTCAAAGACTGACCAGTCCTGAT 
AGGGC TGGAAAG AGG AAAGGGGTGTGGGAAGGTGACTTTCTT AGCTCT C C CGT AGC AGC AC ACT GTC AT A 
GTGTCTCAGTGTCATATTGCTGTGAAGAGACACCACAACCAAGGCAGCTTATGAAGAAAGCCCTTAATTA 
GGGCTCGCTTTACAGTTCCGAGGTTACTTCCTGGTCGTCGTGGTGGGGATGAGGCAGTAGGTAGCAGGCA 
TGGTGCTGGAGCAGTAGCTGAATGTTGGTTCACAAACTTAGCACCTCGAGTCATGATAGAAACACCATCC 
AGGCCTTG AAGCCC AAC ACGT AGC AAC ACC AC TTGC C AAAGC AGG ATCAAGGC TCT GT CAGCGTCC CAGG 
AT GC AGTTGGGC AAGTT CTTG AGTGAAATTC C TAT AC AT AC T AAC TC TGC AATTTT GCTTCT AT AGTTC C 
TCTTCTTGTTAACTATATGTCCAGAACCATTTCAGAATATGG 

GTTTTTGTGAATAAAAAAAATAAAAAGGCCTGGTCTGCATAGATTGGGCAAGTTCCCTTGCAGCCACAGG 
C T AAC AGT AAAG ACTTT CTATTTGGCTTT AGG AGTATCC AAGGGGT GGTCTCT C GTG AGAGT AC CTCGC A 
ACAGCAGCAGATGGTGTTGGAGGCTTGGCCTGCTGTGGGCTTTTGAAACCTTAAAGTCCACCCCCAGTGA 
CACGCCTCCTCCAATAACACCACACCTCCTAATCCTTTCTAAGTAGTCCCTCAACTGGGAACCAAGCATT 
C AGAT AT AC G AGC C C AC AAAGGC C ATTCTC GTT CAAAC C AC CAC ATGT AAT AAAAT AT ATGC C AC GTC AA 
AAAAAAAAAAAAAAAAA (SEQ ID NO : 1 ) 

MKSHYIVLALASLTFLLCLPVSQSCNKALCASDVSKCLIQELCQCRPGEGNCPCC 
KECMLCLGALWDECCDCVGMCNPRNYSDTPPTSKSTVEELHEPIPSLFRALTEG 
DTQLNWNIVSFPVAEELSHHENLVSFLETVNQLHHQNVSVPSNNVHAPFPSDKE 
RMCTVVYFDDCMSIHQCKISCESMGASKYRWFHNACCECIGPECIDYGSKTVKC 
MNCMF (SEQ ID NO:2) 



FIG. 1 



red underlined = deleted in targeting construct 

green = sequence flanking Neo insert in targeting construct 



CCACGCGTCCGCGCGGGAGCTGCTTGGAGGCTCGGCGGCCGGGAGGAGGCCGGGGCCACG 

CTTCTTGGAAGCTACTGAGTGACTTCTTTGAAGAACCATGAAGTCACACTATATTGTGCT 

AGCTCTAGCCTCCCTGACGTTCCTGCTGTGTCTCCCCGTGTCCCAGAGCTGTAACAAAGC 

ACTCTGTGCCAGCGATGTGAGCAAATGCCTCATTCAGGAGCTCTGCCAGTGCCGGCCTGG 

AGAAGGGAACTGCCCCTGCTGTAAGGAGTGCATGCTGTGCCTCGGGGCCCTGTGGGACGA 

GTGCTGCGACTGTGTCGGTATGTGCAACCCTCGGAATTACAGCGACACCCCGCCCACATC 

CAAGAGCACCGTGGAGGAGCTGCACGAGCCCATTCCGTCCCTGTTCAGGGCGCTGACGGA 

GGGCGACACCCAGCTGAACTGGAACATCGTCTCCTTCCCTGTGGCAGAGGAGCTGTCACA 

CCATGAAAACCTAGTCTCCTTCCTAGAAACTGTGAACCAGCTGCACCACCAAAACGTGTC 

TGTTCCC AGCAAC AATGTCC ACGCCCCCTTCCCCAGCGAC AAAG [ AGCGCATGTGCAC AGT 

GGTTTACTTTGATGACTGC ATGTCC ATCC ACCAGTGTA ] AGATATCCTGCGAATCCATGGG 

TGCATCCAAGTATCGCTGGTTTCACAACGCCTGCTGCGAGTGCATCG [ GTCCAGAGTGCAT 

TGACTATGGGAGTAAAACTGTCAAGTGTATGAACTGCATGTTTTAAAGAGGGGGAAGAAA 

TGCAAACCAAAGCA ] AGAACGCTGACGATGCCGTTGACACTCCTACCTTGCAGGATGTCCC 

CACAGAGTCCACTGGACCAGCTTGTACAAGGGTGAAACCTTTCTGCGTCACCTTTCTGCT 

TTTGCAGAGCCATGACAGCAGCTGCCTTACAGCGTGATTTAGCAGACAGACCTGACAGGT 

GTGTCATGTGACAGGCAGCCATCAGGGTGGCCTAGCACATTTGTGGGTGCTGATGTTGTA 

TTGCACTGTCAAAGACTGACCAGTCCTGATAGGGCTGGAAAGAGGAAAGGGGTGTGGGAA 

GGTGACTTTCTTAGCTCTCCCGTAGCAGCACACTGTCATAGTGTCTCAGTGTCATATTGC 

TGTGAAGAGACACCACAACCAAGGCAGCTTATGAAGAAAGCCCTTAATTAGGGCTCGCTT 

TACAGTTCCGAGGTTACTTCCTGGTCGTCGTGGTGGGGATGAGGCAGTAGGTAGCAGGCA 

TGGTGCTGGAGCAGTAGCTGAATGTTGGTTCACAAACTTAGCACCTCGAGTCATGATAGA 

AACACCATCCAGGCCTTGAAGCCCAACACGTAGCAACACCACTTGCCAAAGCAGGATCAA 

GGCTCTGTCAGCGTCCCAGGATGCAGTTGGGCAAGTTCTTGAGTGAAATTCCTATACATA 

CTAACTCTGCAATTTTGCTTCTATAGTTCCTCTTCTTGTTAACTATATGTCCAGAACCAT 

TTCAGAATATGGGTTTTTGTGAATAAAAAAAATAAAAAGGCCTGGTCTGCATAGATTGGG 

CAAGTTCCCTTGCAGCCACAGGCTAACAGTAAAGACTTTCTATTTGGCTTTAGGAGTATC 

CAAGGGGTGGTCTCTCGTGAGAGTACCTCGCAACAGCAGCAGATGGTGTTGGAGGCTTGG 

CCTGCTGTGGGCTTTTGAAACCTTAAAGTCCACCCCCAGTGACACGCCTCCTCCAATAAC 

ACCACACCTCCTAATCCTTTCTAAGTAGTCCCTCAACTGGGAACCAAGCATTCAGATATA 

CGAGCCCACAAAGGCCATTCTCGTTCAAACCACCACATGTAATAAAATATATGCCACGTC 

AAAAAAAAAAAAAAAAAAA (SEQ ID NO : 1 ) 
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Gene Sequence 
Structure * 



639 bp 



Sequence Deleted 



707 bp 



Size of full-tength 
cDNA: 1879 bp 




Targeting Vector* (genomic sequence) 
Construct Number: 203 



Arm Length: 
5": 1.9 kb 
3": 3.6 kb 



5' arm 



Neo 

Cassette 



3' arm 




— — _ Targeting Vector 
----- Endogenous Locus 



* Not drawn to scale 



5 ■ >AACAAATGAAGATCTTTTGTC 

CCTCGTTTTTGTCCTGTGCTGATG 

AGCAGTAAGAGGGCTAGAAAGTAA 

CTGCAGGTATTCTCTGAGCAAGCG 

AGCGAGTGGCCGAGCTCTTCCTGC 

CTGTACTGAAATGTCCCTTTGCAT 

TTCAGAGCGCATGTGCACAGTGGT 

TTACTTTGATGACTGCATGTCCAT 

CCACCAGTGTA<3 ' ( SEQ ID 

NO: 3) 



5 • > GTC C AGAGTGC ATTG AC T ATG 

GGAGTAAAACTGTCAAGTGTATGA 

ACTGCATGTTTTAAAGAGGGGGAA 

GAAATGCAAACCAAAGCAGTAAGT 

CATGAAGTGTGCAGAAATCTTGGT 

TC TGGTATGC TAGG AGTGTGTTAA 

GTTATATGATTGTAACTGTGCTTT 

TTATATCTGGTGCCTATTAGTGTA 

GGTCTTTTCCA< 3 * { SEQ ID 

NO: 4) 
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